‘vVISWa’ – A Multilingual Multi-Pose Audio Visual Database for Robust Human Computer Interaction

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

‘vVISWa’ – A Multilingual Multi-Pose Audio Visual Database for Robust Human Computer Interaction

Automatic Speech Recognition (ASR) by machine is an attractive research topic in signal processing domain and has attracted many researchers to contribute in this area of signal processing and pattern recognition. In recent year, there have been many advances in automatic speech reading system with the inclusion of audio and visual speech features to recognize words under noisy conditions. The ...

متن کامل

Visual body pose analysis for human-computer interaction

Human-Computer Interaction (HCI) is the study of interaction between people (users) and computers. The recent advances in computing technology push the interest in human-computer interaction in other ways than the traditional keyboard, mouse or keypad devices. The work presented in this thesis uses computer vision to enhance the HCI, by introducing novel real-time and marker-less gesture and bo...

متن کامل

Audio-visual intent-to-speak detection for human-computer interaction

This paper introduces a practical system that aims to detect a user's intent to speak to a computer, by considering both audio and visual cues. The whole system is designed to intuitively turn on the microphone for speech recognition without needing to click on a mouse, thus improving the human-like communication between users and computers. The rst step is to detect a frontal face through a si...

متن کامل

A unified approach to multi-pose audio-visual ASR

The vast majority of studies in the field of audio-visual automatic speech recognition (AVASR) assumes frontal images of a speaker’s face, but this cannot always be guaranteed in practice. Hence our recent research efforts have concentrated on extracting visual speech information from non-frontal faces, in particular the profile view. The introduction of additional views to an AVASR system incr...

متن کامل

Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction

We are exploiting the human perceptual principle of sensory integration (the joint use of audio and visual information) to improve the recognition of human activity (speech recognition, speech event detection and speaker change), intent (intent to speak) and human identity (speaker recognition), particularly in the presence of acoustic degradation due to noise and channel. In this paper, we pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Applications

سال: 2016

ISSN: 0975-8887

DOI: 10.5120/ijca2016908696